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RELATED APPLICATION 
This application claims the benefit of U.S. Provisional Patent 
Application Serial No. 60/404,451 filed August 19, 2002, entitled "RXINSIGHT 
SECURITY PROCEDURES" . The entire disclosure of which is incorporated herein 
by reference. 

BACKGROUND OF THE INVENTION 

Technical Field The present invention relates to security techniques 
for customer accessible databases. 

Background Art. Pharmacies and wholesale pharmaceutical outlets 
have engaged various tools to evaluate their market share and monitor other key 
performance indicators in their national or local operating areas. One approach is to 
access company data, e.g., dispensed prescriptions, cash pricing, managed care 
contract rates, and generic dispensing ratios, and to compare that to industry standard 
market measures. For example, a pharmacy chain might determine their own 
company's prescription dispensing growth is 3% in the Philadelphia market, while the 
overall prescription volume is growing at 8%, and therefore determine that the chain 
is losing market share. 

In general, such pharmaceutical outlets treat their own market 
performance information to be a highly protected trade secret. Accordingly, there 
exists a need for a technique which provides a secure environment to provide 
information services to these chains, in a cost-effective manner, while protecting the 
confidentiality of these data, and reducing opportunity for error or corruption in the 
data. 

SUMMARY OF THE INVENTION 
An object of the present invention is to provide a technique for 
permitting pharmaceutical outlets to evaluate their market share in a secure 
environment. 
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Another object of the present invention is to provide information 
services to pharmaceutical outlets in a cost-effective, confidential accurate manner. 

In order to meet these and other objects of the present invention which 
will become apparent with reference to further disclosure set forth below, the present 

5 invention provides a database security access methodology, combined with a software 
application and sophisticated reference files and/or database files which enable 
pharmacy outlets to access confidential pharmaceutical information related to their 
respective companies, from one single database which houses all chains' data. 

The technique is cost-effective since the cost of populating, updating, 

10 and maintaining this information, housed on a central platform, can be spread across 
all pharmacy chains subscribing to the database. The chains do not have to incur the 
costs of purchasing a hardware platform and managing it themselves. There is no 
time delay from when the current month's information is available, and the chains' 
receipt of the data, and they can log on and access it immediately upon completion of 

15 the data load at the data warehouse. There is no delay in transmitting or shipping the 
data to a chain's data center and then waiting for the data load to occur there. 
Additionally, the data warehouse does not have to generate multiple large data marts 
for each chain that desires access to these data, thus resulting in additional cost 
savings, minimizing risk of data corruption in reproduction, and facilitating 

20 maintenance and updates to the data by only having one data warehouse to update 
versus many. 

A system and method for evaluating the performance of a first at least 
one of a plurality of outlets is provided. The system includes a computer system 
configured to execute a data access application, wherein the data access application 

25 includes a plurality of user accounts, wherein each of the plurality of user accounts 
includes a user group, and wherein a second at least one of the plurailty of outlets is 
associated with the user group. The system also including a data storage device 
configured to store market measures from a portion of the plurality of outlets and 
industry-standard market measures, wherein the data access application allows a user 

30 to access the industry-standard market measures and data associated with the outlets 
associated with the user group of user's user account. 

In another advantageous embodiment of the present invention, a 
system and method for evaluating the performance of a first at least one of a plurality 
of outlets is provided. The system includes a computer system configured to execute 
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a data access application, wherein the data access application includes a plurality of 
user accounts, wherein each of the plurality of user accounts includes a user group, 
and wherein a second at least one of the plurailty of outlets is associated with the user 
group. The system also including a data storage device configured to store market 

5 measures from a portion of the plurality of outlets and industry-standard market 
measures, wherein the data access application allows a user to access the industry- 
standard market measures and data associated with the outlets associated with the user 
group of user's user account. 

The accompanying drawings, which are incorporated and constitute 

10 part of this disclosure, illustrate preferred embodiments of the invention and serve to 
explain the principles of the invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a functional diagram of an exemplary method in accordance 
1 5 with the present invention; 

FIG. 2 is a block diagram showing an arrangement of sales outlets and 
a processing station illustrative of an embodiment of the invention; 

FIG. 3 is a block diagram of a processing system that may be used as 
the central station of FIG. 2; 
20 FIG. 4 is a flow chart showing the estimation of prescription sales at an 

outlet that is illustrative of the invention; 

FIG. 5 is a flow chart showing the determination of sampled and 
unsampled outlets of FIG. 4 in greater detail; 

FIG. 6 is a flow chart showing the selection of the group of sampled 
25 outlets for each unsampled outlet in greater detail; 

FIG. 7 is a flow chart showing one arrangement for the estimation of 
prescriptions sales for a prescribing physician at a plurality of pharmacies illustrative 
of the invention; 

FIG. 8 is a flow chart showing the confidence signal operation of FIG. 
30 7 in greater detail; and 

FIG. 9 is a flow chart showing another arrangement for the estimation 
of prescription sales for a prescribing physician at a plurality of pharmacies 
illustrative of the invention. 
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Throughout the drawings, the same reference numerals and characters, 
unless otherwise stated, are used to denote like features, elements, components, or 
portions of the illustrated embodiments. Moreover, while the present invention will 
now be described in detail with reference to the FIGS., it is done so in connection 
5 with the illustrative embodiments. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

FIG. 1 illustrates an exemplary embodiment of a system and method 
for permitting retail outlets or a chain of retail outlets to evaluate their performance in 
10 a particular market in a secure environment. Generally, the exemplary system and 
method deliver proprietary sales data and market sales data to a user, thus allowing 
the user to appreciate the performance of the user's retail outlet or chain of retail 
outlets. Specifically, the exemplary methods and systems deliver proprietary 
pharmaceutical chain data and pharmaceutical market data to a user, thus allowing the 
15 user to appreciate the performance of the user's pharmaceutical retail outlet or chain 
of pharmaceutical retail outlets. 

FIG. 1 illustrates a block diagram of a pharmaceutical data analysis 
system 10. The data analysis system 10 includes a communications network 16, the 
central server 120, a chain database 30, a projected database 40 and a store master 
20 database 50. The pharmaceutical data analysis systme 10 allows a user 12, 14 to 
upload market measures for a current month and receive information pertaining to 
pharmaceutical retail outlet, a chain of pharmaceutical retail outlets, and/or a portion 
of the chain of pharmaceutical retail outlets. 

In a preferred embodiment, the communications network 16 is the 
25 Internet. In another preferred embodiment, the communications network 16 is a 
private network. 

The user 12 utilizes the central station 120 by using his or her user 
account and logging into a data access application 22 through the communications 
network 16. The central station 120 is a server, which includes a transceiver 26. Data 
30 may be transmitted or received by the central station 120 through the transceiver 26. 
The data access application 22 and a database security application 24 run on the 
central station 120. The user may use the data access application 22 and indirectly the 
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database security application 24 to upload market measures for a current month, 
receive information, generate reports, and the like. 

In a preferred embodiment, multiple transceivers may be utilized. In 
another preferred embodiment, the transceiver 26 is a network interface card. 
5 The users 12, 14 upload market measures for a particular month on a 

monthly basis. The data access application 22 and the database security application 
24 receive the market measures from the users 12, 14 and populate the chain database 
30, the projected database 40 and the store master database 50 with market measures 
for a current month. Each of the chain database 30, the projected database 40 and the 

10 store master database 50 are Oracle-based relational databases. The chain database 30 
houses market measures including all pertinent transactions submitted to the system 
10 from a particular retail establishment's retail transaction systems, and are stored in 
a manner which enables each retail establishment's data to be identifiable and 
retrievable. For example, information identifying the particular pharmacy may be 

15 included. The retail establishment can be a single establishment or a chain of 
establishments. 

The projected database 40 houses aggregated, projected, industry- 
standard market measures. The technique for storing and generating the aggregated, 
projected, industry-standard market measures is described hereinbelow with reference 

20 to Figs. 2-9. The projected database 40 provides the benchmark market data for 
retail establishments, chains of retail establishments or portions of chains of retail 
establishments to compare their performance to that which is occurring in an overall 
relevant marketplace. 

The store master database 50 is also updated on a monthly basis. The 

25 store master database 50 is a proprietary reference database, also stored in an Oracle- 
based relational data table, which contains the relationships between each retail store 
in a geographic area, its local operating company name called the Organization Name, 
and its Parent company. Preferably, the geographic area is a country, for example: the 
United States. The store master database 50 contains information describing the 

30 many diverse relationships that exist as large corporate entities continue to acquire 
local retail chains and independent stores offering the product or products being 
tracked, while maintaining the local operating business name. An example of data 
stored in the store master database 50 follows as Table I: 
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Table I 



Store Name/Address 


Organization Name 


Parent Name 


ABC DRUGS 


Cutter Supply, Inc. 


Diamond Valu Worldwide 


Jewel's Crystal Mart 


Colorful Savings Co. 


Diamond Valu Worldwide 


Lucky 5 s Save-a-Lot 


Carrot-Top Enterprises 


Diamond Valu Worldwide 


Outlet A 


Discount Supplies 


Corporation A 


Outlet B 


Discount Supplies 


Corporation A 


Sunny' s Selections 


Clarity Surplus 


Diamond Valu Worldwide 



In order to provide secure access to the databases 30, 40, 50, they are 
5 accessed by users 12, 14 at the retail chain headquarters via a commercially available 
web-based decision support tool: the data access application 22. Since it is critical 
that each user 12, 14 have access to only a portion of the data contained in the 
database 30 and complete access to the data contained in the projected database 40, 
several layers of security are required. 
10 In a preferred embodiment, the data access application 22 is provided 

by Business Objects. 

Each user 12, 14 has an account including a username and password. 
Each account is also associated with at least one parent name, for example a user 
account may be associated with the parent name "Diamond Valu Worldwide." Upon 

1 5 establishing the new user account, the user is given access to all data corresponding to 
a particular parent entity by default. If a user's account is associated with Diamond 
Valu Worldwide, the user may access data associated with each of the stores that are 
also associated with Diamond Valu Worldwide, i.e., ABC Drugs, Jewel's Crystal 
Mart, Lucky' s Save-A-Lot, and Sunny's Selections. The user would not be able to 

20 access data corresponding to Outlet A and Outlet B, because that data is associated 
with the parent "Corporation A." The user's access may be restricted to a portion of 
the data corresponding to the parent entity. A user account may be given access to 
data corresponding to all stores associated with a parent entity, all stores associated 
with a particular organization of a parent entity, a particular store or another group of 

25 stores. Using the example above, Diamond Valu Worldwide may create a new user 
account for Jewel's Crystal Mart's manager to see only data pertaining to Jewel's 
Crystal Mart. 

In order to realize these differing levels of security, the database 
security application 24 utilizes Oracle security procedures. A "user group" is 
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developed, associating specific rows in the database to a given parent name, 
organization name or other subset of retail establishments. A user account associated 
with a particular user group is only able to access the rows associated with the user 
group. If the store master database 50 included the data as described by Table I, a 

5 Diamond user group could be developed based upon the parent name "Diamond Valu 
Worldwide." A user account associated with the Diamond user group would be able 
to access the rows associated with ABC Drugs, Jewel's Crystal Mart, Lucky' s Save-a- 
lot and Sunny' s Selections, but not the rows associated with Outlet A and Outlet B. It 
should be noted that all user groups are provided access to the entire projected 

10 database 40. 

In a preferred embodiment, a user group may be associated with more 
than one parent name. 

A level of security is realized using the data access application 22. 
Using security procedures of the data access application 22, a new user account may 
15 be established which enables access to the data access application 22 interface. The 
new user account is associated with a user group, which is developed as described 
above. In the example above, a new user account for "John Smith" could be 
associated with the "Diamond" user group. 

A further level of security is established at the data element level. 
20 Each parent entity or organization, may submit a chain attribute data file. The chain 
attribute data file is merged into the chain database 30 and/or the projected database 
40. This file contains chain-specific attributes, from which customized reports and 
analytic measures may be created. For example, a parent entity or organization could 
send data to the store master database 50, which would group stores into districts or 
25 regions, in order to generate relevant reports for their district or regional managers. 
The chain-specific attributes are also linked into the database via the parent name and 
the user group security noted above. These data are not visible to any user outside of 
the retail chain's user group. 

FIG. 2 depicts an arrangement illustrating a first embodiment of the 
30 invention in which product sales at unsampled sales outlets are estimated. In FIG. 2, 
there are shown an area 100, sampled sales outlets 110-1, 110-2, 110-3, 110-N-l and 
110-N and unsampled sales outlets 110-4 and 110-5 in the area 100 and a central 
station 120. Each of sampled outlets 110-1, 110-2, 110-3, 110-N-l and 110-N may 
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preferably be coupled via a line of lines 130-1 through 130-N to the central station 
120. 

In FIG. 2, the outlets may be pharmacies or other type of retail stores 
or distribution establishments all of which distribute a particular product. The outlets 
5 are at various locations in the area 100. While there are 7 outlets shown in FIG. 2 for 
purposes of illustration, it is to be understood that there are generally hundreds or 
thousands of outlets which are not restricted to a given area. The location of each 
outlet is generally known in terms of latitude and longitude from available census data 
or in terms of zip code centroids from Post Office data. Accordingly, the distances 
10 between pharmacies can be determined. Product sales data generated at each outlet S n 
(e.g., 110-1) is preferably transferred to the central station 120 via a line (e.g., 130-1). 
Unsampled outlets U n (e.g., 110-4 or 110-5) in the area 100 are not coupled to the 
central station or, if coupled, do not supply valid sales data so that only an estimate of 
the sales volume of the particular product can be made. 
15 A single area 100 is shown in FIG. 2 for purposes of illustration only. 

According to the invention, the estimation of sales activity at an unsampled outlet U is 
formed on the basis of the sales activity at the sampled outlets S in a neighborhood of 
the unsampled outlet U. The neighborhood of an unsampled outlet U may be defined 
as the N closest sampled outlets S which is different for each unsampled outlet and 
20 : not as a predefined geographic area. In an urban area, the neighborhood of closest 
sampled outlets may all be located within a short distance of the unsampled outlet. In 
a rural area, the neighborhood of N closest sampled outlets nay spread over distances. 
Consequently, each unsampled outlet has its own neighborhood area which varies 
according to the distances to the nearest N sampled outlets. Advantageously, the 
25 correlation of sales activity data is not restricted to a predetermined geographic area 
as in the prior art. 

FIG. 3 depicts a block diagram of the central station 120 of FIG. 2 
which includes an input/output unit 201, a characteristics and location store 205, a 
sales data store 210, a processor 215, a program store 220, a bus 225, a transfer store 
30 228 and work station processors 230-1 through 230-N. Input/output 201 is coupled to 
bus 225 and is also coupled to the sampled outlet lines 130-1 to 130-N and to output 
line 245. The characteristics and location store 205, the prescription data store 210 
and the program store 225 are coupled to the bus 225. The Processor 215 is coupled to 
the bus 220, the transfer store 228 and is also coupled to work station processors via a 
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control line 240. The transfer store 228 is coupled to work station processors 230-1 to 
230-N and the work station processors are coupled together through a network 250. 
Characteristics signals stored in store 205 may include signals representing the type of 
outlet or the total sales of all products at the outlet. This type of data is available from 
5 sources such as Drug Distribution Data (DDD ®) available from IMS America, 
Plymouth Meeting, Pa. DDD ® is a trademark of IMS America. Program store 220 
stores instruction signals that control the operation of the processor 215 and provide 
parameter signals to determine the operation of work station processors 230-1 through 
230-N through the processor 215 and control line 240. 
10 The sales outlet data received by input/output 201 from sales outlets 

which may exceed 2 x 10 9 records each having between 88 and 1000 bytes is 
transferred to data store 210. In view of the large amount of data to be processed, the 
processing is divided between the processor 215 and the work station processors 230- 
1 through 230-N. Information signal arrays produced in the processor 215 are 
15 transferred to work station processors 230-1 through 230-N through the transfer store 
228. Each information signal array from the processor 215 placed in the transfer store 
228 is divided into N portions. A preassigned portion of the information signal array 
in the transfer store is supplied to each of work station processors 230-1 through 230- 
N and the processing of the portions in the work stations 230-1 through 230-N is 
20 controlled by signals from the processor 215 via the control line 240. After the 
processing of the information signal array portions in the work station processors, the 
processed information signal array portions are merged into one processed 
information signal array which is returned to transfer store 228 from the work station 
processors. The returned information signal array in the transfer store 228 is then 
25 further processed in the processor 215 to produce estimate sales volume results. The 
operation of the system of FIG. 3 will be further described in connection with the 
estimation arrangements shown in FIGS. 4-8. 

FIG. 4 depicts a flow chart illustrating the operation of the central 
station 120 of FIG. 2 in estimating the volume of sales of a particular product at an 
30 unsampled outlet such as outlet 04 or outlet 05 in FIG. 2. The operations depicted in 
the flow chart of FIG. 4 are performed by processor 215 and work station processors 
230-1 to 230-N of FIG. 3 under control of instruction signals from in the program 
store 220. In the flow chart of FIG. 4, product data from outlets Ol, 02, 03, ON-1 
and ON are transferred to the input/output unit 201 preferably via corresponding lines 
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130-1 through 130-N in step 301. Transferred data is stored in outlet sales data store 
210. A data transfer from an outlet may occur for each sales transaction or may 
include a number of transactions for a prescribed period of time. At preset intervals, 
the sales data is sent to processor 215 and therein is evaluated in step 3 10 to determine 
5 the sampled outlets Si and the unsampled outlets Up in the processor 215. Unsampled 
outlets may include outlets transferring data evaluated as invalid. 

FIG. 5 shows a method of determining sampled and unsampled outlets 
in greater detail. Referring to FIG. 5, an outlet index n for the outlets Ol, 02, . . . , 
ON is set to 1 in step 401. A sampled outlet index i and an unsampled outlet index p 
10 are set to 1 in steps 405 and 410. The sales data for the particular product from each 
outlet On is checked in decision step 415 to determine if the data is valid (i.e., meets 
predetermined criteria). If the data is judged to be valid in step 415, the outlet On is 
classified as a sampled outlet Si in step 420 and the index i is incremented in step 425. 
When no data is available for the outlet On or the data is not accepted as valid in step 
15 415, the outlet On is classified as an unsampled outlet Up in step 422 and the index p 
is incremented in step 427. The index n is then incremented in step 430. Until index n 
is greater than N for the last outlet ON, step 415 is reentered from decision step 435. 
When all of the outlets Ol through ON have been classified as sampled outlets and 
unsampled outlets, the last value of index p (pmax) and the last value of index i 
20 (imax) representing the number of unsampled outlets and the number of sampled 
outlets are stored in data store 210 (step 440). 

As shown in FIG. 2, there are five sampled outlets 110-1 (Ol), 110-2 
(02), 110-3 (03), 110-N-l (ON-1) and 110-N (ON) which are designated SI, S2, S3, 
S4 and S5 from the processing of FIG. 5 and two unsampled outlets 110-4 (04) and 
25 110-5 (05) which are designated as Ul and U2 from processing of FIG. 5. Unsampled 
outlet Ul is located in the central portion of the area 100 and is surrounded by 
sampled outlets SI through S5. Unsampled outlet U2 is located at one edge of the area 
100, is closest to sampled outlet S5 and most remote from sampled outlet SI. Priorly 
known techniques based an estimate of the sales volume of a product at an unsampled 
30 outlet on the sales volume of the product for the geographic area. Since the sales 
outlets have different characteristics (e.g., size and location) and have sales related to 
outlets outside a particular area, estimates based on the overall sales volume in a 
particular area as in the prior art are biased. In accordance with the invention, an 
estimate of sales volume of a particular product at a sales outlet is based on the known 
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sales volume of other outlets according to the distances between the sales outlet and 
the other outlets and the particular characteristics of the outlets independent of any 
geographic area. By using the outlet characteristics and the distances, an unbiased and 
more accurate estimate may be determined. 

Signals corresponding to the distances between unsampled outlet Ul 
and sampled outlets SI through S5 and the distances between outlet U2 and sampled 
outlets SI through S5 are then formed in step 330. In step 335, the mmax closest 
sampled outlets to unsampled outlet p are selected. The selection is performed in the 
processor 215. mmax may be chosen according to the total number of sampled 
outlets. The selection of sampled outlets associated with each unsampled outlet is 
shown in greater detail in FIG. 6. 

With reference to FIG. 6, a set of distance signals dip* for sorting is 
generated in step 501 corresponding to the distance signals dip generated in step 330. 
The unsampled outlet index p is set to one in step 505. A selected outlet index m is set 
to one in step 510 and the sampled outlet index i is set to one in step 515. In step 520, 
a signal D is set to LPN (largest possible number) and the loop including steps 525, 
530, 535 and 540 is entered to find the smallest distance of the distances dip 1 . 

In decision step 525, the signal dip' representing the distance from 
sampled outlet Si and unsampled outlet Up is compared to D. When dip 1 is less than 
D, D is set to dip', Rm representing a tentative selected outlet is set to Si, the index i* 
is set to i and a tentative selected distance signal dmp is set to dip 1 in step 530. Step 
535 is then entered in which sampled outlet index i is incremented. Where dip 1 is not 
less than D, step 535 is entered directly from decision step 525. Decision step 540 is 
then entered. Until sampled outlet index i exceeds imax in step 540, step 525 is 
reentered to compare the next distance signal dip' to the last determined minimum 
distance signal. When i exceeds imax, the minimum of the selected sampled outlets is 
chosen as Rm. The minimum distance signal dip 1 is then set to LPN in step 545 to 
exclude Si* from comparison in step 525 and the selected outlet index m is 
incremented in step 550. 

Step 515 is reentered from step 555 until mmax closest outlets for 
unsampled outlet p are selected and another outlet Rm is chosen in the loop from step 
525 through 540. Upon selection of mmax sampled outlets, the unsampled outlet 
index p is incremented in step 560 and step 510 is reentered via decision step 565 so 
that a set of m sampled outlets may be selected for the next unsampled outlet Up in 
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FIG. 6 via decision step 565. When p is greater than pmax, control is passed to step 
340 in FIG. 4. 

In step 340, index p is set to one. A weighting factor w m is then 
determined for each selected sampled outlet Rm of unsampled outlet Up in step 345. 
Weighting factor generation is performed in the processor 215. The weighting factor 
is an inverse function of the distance between the sampled outlet Rm and the 
unsampled outlet Up and the characteristics of the sampled and unsampled outlets 
according to 

W m ^{(l/dRmUpVCSCTin /d*nUp q )}*Tup (1) 

where dRmUp is the distance between sampled outlet S m and unsampled outlet 
Up, the summation is over all sampled outlets for m=l to mmax, T Up is the unsampled 
outlet characteristic (e.g., total sales volume for all products), T m is the sampled outlet 
characteristic and q is greater than zero, q may, for example, be 2. Index p is 
incremented in step 348 and control is passed to step 345 until p is greater than pmax 
in decision step 350. ■ 

The weighting factor signals for unsampled outlets Up and the product 
data for the outlets are read into the transfer store 228 as a data array which is divided 
therein into N data array portions. The processor 215 sends control signals to work 
station processors 230-1 through 230-N to initiate processing of the data file portions 
in the work station processor. Each work station processor then proceeds to form a 
product estimate signal for the data file portion assigned to it as indicated with respect 
to the entire data file in steps 355 through 370 in FIG. 4. In step 355-1, a starting 
value of the unsampled outlet index p=l is set. The loop from step 360-1 to step 370-1 
is then entered. The estimated sales of the particular product is then generated for a 
range of unsampled stores Up in step 360 according to 

Est (Vup) = Sw m V m (2) 
where V m is the sales volume of the particular product at sampled outlet m and 
the summation is over the sampled outlets from m = 1 to m = mmax. The unsampled 
outlet index p is incremented in step 365-1 and control is passed back to step 360-1 
via decision step 370-1 until p is greater than the maximum of the range processed in 
work station processor 230-1 and an estimate of sales volume for all unsampled 
outlets in the range has been formed. The processing of the other work station 
processors 230-2 through 230-N is the same as described with respect to the work 
station processor 230-1 except that the range is determined by the portion of the data 
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file sent to the work station. The processing in the work station processor 230-N is 
shown in the steps 355-N through 370-N. 

For purposes of illustration with respect to FIG. 2, the number of 
selected sampled outlets mmax is chosen as 3. It is to be understood, however, that 
5 other values may be chosen. For example, if there are 50 or more sampled outlets, 
mmax=10 is a suitable value. In FIG. 2, sampled outlets Ol, 02 and 03 are selected 
as the three closest sampled outlets Rl, R2 and R3 to unsampled outlet Ul. To 
illustrate the invention, assume that the distance d R iui from sampled outlet Rl to 
unsampled outlet Ul is 0.4 miles, the distance diaui between sampled outlet R2 and 
10 unsampled outlet Ul is 0.2 miles and the distance diuui between sampled outlet R3 
and unsampled outlet Ul is 0.6 miles. Further assume that the total sales volume for 
all products at sampled outlets Rl, R2, R3 and Ul are $3,000, $2,000, $5,000 and 
$4,000, respectively. The weighting factor for sampled outlet R2 is then 

w 2 ={(l/0.2) 2 /(2000/(0.2) 2 +3000/(0.4) 2 +5000/(0.6) 2 )}*4000 (3) 
15 w 2 =1.210084 

Similarly, w, =0.302521 and w 3 =0.13445377. For a sales volume of the particular 
product at Rl, R2 andR3 of 5, 20 and 4, respectively, the estimated sales volume of the 
particular product at unsampled outlet Ul is 

Est(Vui)=Wl *VR1 +W 2 Vr2 +W 3 Vr3 W 

20 Est(V u i)=26.252 

The product volume signals for the sampled outlets Si is then formed in step 372-1 
through 372-N and the total estimated sales volume of the product for unsampled and the 
sampled outlets is then formed for each range in the work station processors 230-1 
through 230-N in steps 375. The resulting unsampled outlet estimate and total volume 
estimate signals of the processing in the work station processors is then merged in and 
totaled step 380 into a result data file. The result data file is transferred to transfer store 
228 and therefrom to data store 210. The results are then sent to output line 245 of the 
input/output 201. 

FIG. 7 shows a flow chart illustrating estimation of the distribution of a 
controlled product by a control authority. More particularly, FIG. 7 shows the 
operation of the arrangement of FIGS. 2 and 3 in estimating sales of a prescription 
product for a prescribing physician at the pharmacies. The operations in the flow chart 
of FIG. 7 are performed by processor 215 and work station processors 230-1 through 
230-N of FIG. 3 under control of corresponding instruction signals stored in the 
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program store 220. Referring to FIG. 7, location data, data of type characteristics of 
pharmacy outlets Ol through ON in FIG. 2 and physician identification data are 
stored in the characteristics and locations store 205 of FIG. 3 in step 601. Prescription 
data is transferred from pharmacies Ol, 02, 03, ON-1 and ON and is stored in 
5 pharmacy outlet data store 210 according to the prescribing physicians (step 603). 

At prescribed intervals, the total sales for the prescribing physician j of 
a particular prescription product is estimated in steps 605 through 670 of FIG. 7. In 
step 605, the processor 215 operates to determine the sampled pharmacy outlets Sij 
and the unsampled pharmacy outlets Up for a particular prescription product 
10 according to the validity and volume of the transferred prescription data of the 
prescribing physician j. The arrangement shown in FIG. 5 may be used in the 
determination of step 605. As described with respect to FIGS. 2 and 3, pharmacy 
outlets Ol, 02, 03, ON-1 and ON can be determined as sampled outlets Slj, S2j, S3j, 
S4j and S5j where the sampled data is validated. Outlets 04 and OS are classified as 
1 5 unsampled outlets Ul and U2. 

Step 610 is entered from step 605 and signals representative of the 
distances dip between each sampled outlet Sij and each unsampled outlet Up are 
generated in the processor 215. After the distance determination of step 610, the set of 
nearest sampled pharmacy outlets Rmj for each unsampled pharmacy outlet Up is 
20 selected by processor 215 according to step 615. Selection of sampled pharmacies 
may be performed as described with respect to FIG. 6. Then, the unsampled pharmacy 
outlet index p is set to 1 in step 620 and a weighting factor signal wm for the each 
sampled pharmacy outlet Rm (m=l to mmax) is generated in loop from step 625 to 
step 634. Each weighting factor signal is formed according to 
25 w m ^ai/d^up^WRn, /dRmUp^Tup (5) 

where q is greater than 0, Tup is the total sales volume of all products at pharmacy outlet 
Up, Trhi is the total sales volume of all products at pharmacy outlet Rm and the 
summation S is from m=l to mmax. 

After the weighting factor signals have been formed for the last 
30 unsampled pharmacy in the loop from step 625 to 634, a data array including the 
weighting information and the sales data from store 210 is formed by the processor 
215 and sent to transfer store 228. The data array is divided into N portions each of 
which is processed by one of work station processors 230-1 through 230-N to form a 
signal representing an estimate of the total prescription product sales volume for 
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physician j. The operations of the work station processors are controlled by the 
processor 215 through the control line 245. Each work station processor operates to 
process a predetermined range of the data array in the transfer store 228. 

The work station processor 230-1 operates according to steps 

5 638-1 through 655-1 to form the volume product signals for physician j in the range 
p=l to p=pl. In step 640-1, the prescription estimate signal for unsampled outlets Upj 
is formed for each unsampled pharmacy in the range from p=l to p=pl and the sales 
volume signal for the sampled pharmacies in this range is determined by the work 
station processor 230-1 according to step 655-1. Work station processor 230-N 

10 operates in similar manner for physician j over the range p=pN to p=pmax as 
indicated in FIG. 7 according to steps 638-N through 655-N. Signals are transferred 
from one work station processor to another as required for the operation of the one 
work station processor through the network 255. The results of the operation of work 
station processors 230-1 through 230-N are merged in step 660 and an estimate of the 

1 5 prescription product volume 

Vx^Vsij+SzVjUp ( 6 ) 
is generated in step 665 where 

V jUp = 2 3 {{(l/d^UpVCSs (TRm /dRmUp^} *T Up {V^j (7) 

Ei is the summation over all sampled outlets, S2 is the summation over all unsampled 
20 outlets, S 3 is the summation over all sampled outlets in the neighborhood of unsampled 
outlet Up. At this time, a confidence signal that estimates the degree of possible error of 
the total volume V-q of the product prescriptions of the prescribing physician j is then 
generated in step 670. 

FIG. 8 shows the confidence signal generation operation of step 
25 670 in greater detail. Referring to FIG. 8, a mean squared error signal MSE is first 
generated by bootstrapping on the basis of the sales data from the sampled pharmacy 
outlets SI through Simax in step 701. The bootstrapping method is well known in the 
art and is described in "The Jacknife, The Bootstrap, and Other Resampling Plans" by 
B. Efron, Society for Industrial and Applied Mathematics (SIAM) Publications, 
30 Philadelphia 1982. 

In the bootstrapping, subsets of pharmacy outlets are selected 
and the prescribing physician's prescription volume is estimated therefrom. The 
variances of the "bootstrapped" estimates closely approximates the true variance. A 
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generalized variance function (GVF) is derived from the MSEs generated in step 701 
of the form 

log(SQRT(MSE))=a-fblog (Tj)+clog(Nj)+dlog(N Sj ) (8) 
where SQRT is the square root, a, b, c and d are regression coefficients, Tj is the 

5 estimated total of the prescription product prescribed by physician j, Nj is the total of 
prescription products prescribed by physician j and dispensed at the sampled pharmacy 
outlets and Nsj is the number of sampled pharmacy outlets with prescription product 
sales for physician j. The generalized variance function is described in "Introduction to 
Variance Estimation" by K. M. Wolter, Springer-Verlag, New York 1985. 

10 The values of Tj, Nj and NSj are determined in steps 710, 715 and 720 

from the prescription data in store 210 of FIG. 3 in processor 215. Regression 
coefficient signals a, b, c and d are generated by multiple regression techniques well 
known in the art and a log(SQRT(MSE)) value for the physician j is determined in 
step 720. Decision step 725 is then entered in which the value log(SQRT(MSE)) is 

15 compared to Kl. If log(SQRT(MSE)) is less than Kl, a low estimated error signal is 
produced in step 730. The value log(SQRTQMSE)) is then compared to K2>K1 in step 
735 to produce a medium estimated error signal in step 740 if log(SQRT(MSE)) is 
less than K2. Where log(SQRT(MSE)) is not less than K2, a high estimated error 
signal is generated in step 745. While three values for the estimated error signal are 

20 determined in the flow chart of FIG. 8, it is to be understood that any number of 
values such of 5 may be used. 

The flow chart of FIG. 9 illustrates another arrangement for estimating 
the prescription product sales volume of a prescribing physician. According to the 
arrangement of FIG. 9, a group of sampled pharmacies is selected for each unsampled 

25 pharmacy and a weighting factor for each sampled pharmacy in the neighborhood of 
one of the unsampled pharmacies is generated as in FIG. 7. The weighting factors for 
the sampled pharmacies are combined with the actual sales data for the sampled 
pharmacies according to 

V Tj = Ei Vsij £3 V sij (9) 

30 which corresponds to 

VTj = 2iVsij[l+£jW ip ] (10) 
where V S ij is the prescription product sales volume for physician j at pharmacy i, Wi P is 
the weighting factor for a sampled pharmacy i in the selected neighborhood of 
unsampled pharmacy p, Si is the summation over all sampled pharmacies and £2 is the 
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summation over all unsampled pharmacies, £3 is the summation of all sampled 
pharmacies in the neighborhood of unsampled pharmacy p and S4 is the summation of 
weighting factors associated with sampled pharmacy i. The resulting estimate of sales 
volume for the prescribing physician is similar to that described with respect to FIG. 7 

5 but the efficiency of the estimate generation is improved 

In the method of FIG. 9, the sampled pharmacies in the neighborhood 
of each sampled pharmacy are first determined and the weighting factor signals w ip 
for the neighborhood sampled pharmacies are formed in the processor 215 of FIG. 3 
on the basis of the pharmacy location, the characteristics data and the physician 

10 identification data in store 205. The sales data from the sampled pharmacies and the 
weighting factor signals Wj p are transferred to the transfer store 228, divided therein 
into prescribed ranges and each range of data is supplied to an assigned one of the 
work station processor 230-1 through 230-N. In each work station processor, the 
estimated sales volume for each sampled pharmacy outlet in range processed by the 

15 work station processor is generated from the sales data and the projection factor for 
the sampled pharmacy outlet. The total estimated sales volume for the sampled 
pharmacy outlets over range of the work station processor is then generated. When the 
work station processing is completed the resulting estimated sales volumes from the 
work station processors are merged and transferred via the transfer store 228 to the 

20 processor 215. 

Referring to FIG. 9, the locations, type characteristics of pharmacy 
outlets Ol through ON in FIG. 2 and physician identification data are stored in the 
characteristics and locations store 205 of FIG. 3 in step 801. Prescription data is 
received from the pharmacies Ol, 02, 03, ON-1 and ON and is stored in data store 

25 210 according to the prescribing physicians (step 803). 

The total sales for the prescribing physician j of a particular 
prescription product is estimated in steps 805 through 870. In step 805, the processor 
215 operates to identify the sampled pharmacy outlets Sij and the unsampled 
pharmacy outlets Up for a particular prescription product according to the validity and 

30 volume of the transferred prescription data of the prescribing physician j. The 
arrangement shown in FIG. 5 may be used. As described with respect to FIGS. 2 and 
4, pharmacy outlets Ol, 02, 03, ON-1 and ON can be determined as sampled outlets 
Slj, S2j, S3j, S4j and S5j where the sampled data is validated. Outlets 04 and OS are 
classified as unsampled outlets Ul and U2. 
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Step 810 is entered from step 805 and signals representative of the 
distances dip between each sampled outlet Sij and each unsampled outlet Up are 
generated in the processor 215. After the distance determination of step 810, the set of 
nearest sampled pharmacy outlets Rmj for each unsampled pharmacy outlet Up is 
5 selected by processor 215 according to step 815. The selection of sampled pharmacies 
Rmj may be performed as previously described with respect to FIG. 6. 

The unsampled pharmacy index p is reset to one in* step 820 and 
weighting signals wmp are formed for the selected sampled pharmacies Rmi in the 
loop from step 825 to 834. In step 825, the weighting signal 

10 W^ -{(l/dRmUpVCSCTn, /dR^} *Tup (1 1) 

is formed for each sampled pharmacy m selected as in the neighborhood of sampled 
pharmacy Up. After the weighting signals for all unsampled pharmacies are formed in 
the processor 215, the sales data stored in store 210 and the weighting signals are 
transferred to transfer store 228 wherein the data and weighting signal array is divided 
15 into N portions and each portion is transferred to one of work station processors 230-1 
through 230-N. The processing of the portion transferred to work station processor 230-1 
is shown from step 838-1 through 855-1 in FIG. 9 and the processing the portion 
transferred to work station processor 230-N is indicated from step 838-1 to 838-N. 

With respect to the processing of the range of sampled pharmacies in 
20 work station processor 230-1, the sampled pharmacy index i is set to one in step 838-1 
and the loop from step 840-1 through 850-1 is iterated to form the estimated 
prescription volume V'ij for the sampled pharmacies in the range from i=l to i=il. In 
each iteration, an estimated volume signal is generated for sampled pharmacy i in step 
840 according to 

25 VTs^Vsijtl+Sw^] (12) 

where V S y is the actual prescription product sales volume for physician j at pharmacy i, 
Wmp are the weighting factor for sampled pharmacy Sij and £ is the summation over all 
weighting factors associated with the pharmacy Sij . [1+Swmp ] is a projection factor for a 
physician's prescription at the sampled pharmacy Vsy. 

30 After the estimated volume signal is formed for the range i=l to il in 

the work station processor 230-1, step 855-1 is entered from step 850-1 wherein the 
total volume for the range from i=l to i=il by summing the estimated volumes for the 
pharmacies Sij. The work station processor 230-N operates in similar manner from 
step 838-N to 855-N to generate a total estimated volume for the range from iN to 
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imax. Work station processors are interconnected by the network 250 such as an 
ethemet or token ring arrangement so that signals from one work station processor 
that are required for the formation of the VT-rsij signal in another work station are 
transferred The total volume signals for the ranges are merged in step 860 and the 

5 merged signals are transferred to the processor 215 via the transfer store 228. The 
resulting estimated total volume signal V T j for the physician j is then formed in the 
processor 215 (step 865) and a confidence signal for the estimated total volume Vr } is 
generated in step 870 as described with respect to the flow chart of FIG. 8. 

The foregoing merely illustrates the principles of the invention. 

10 Various modifications and alterations to the described embodiments will be apparent 
to those skilled in the art in view of the teachings herein. It will thus be appreciated 
that those skilled in the art will be able to devise numerous techniques which, 
although not explicitly described herein, embody the principles of the invention and 
are thus within the spirit and scope of the invention. 
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